Unlock a world of possibilities! Login now and discover the exclusive benefits awaiting you.
 
					
				
		
 posywang
		
			posywang
		
		
		
		
		
		
		
		
	
			
		
		
			
					
		Hi, I have two data source and I'd like to merge them but some projects are duplicate. I want to keep the row that has value in "Project Approach" column and get rid of the one with no value, only if there's a duplicate. Sample data and file are attached. Can anyone help? Thanks.
 
					
				
		
Hello!
Try this:
Data:
Load * Inline
[
Project, ProjectDate, ProjectApproach
P123, 01/01/2018, Agile
P235, 03/03/2018, Non-Agile
P353, 06/03/2018, Agile
P353, 06/03/2018,
P235, 03/03/2018,
P432, 03/05/2018,
];
FinalData:
NoConcatenate
Load FirstValue(Project) As Project,
FirstValue(ProjectDate) As ProjectDate,
LastValue(ProjectApproach) As ProjectApproach
Resident Data
Group By Project, ProjectDate
Order By ProjectApproach;
Drop Table Data;
Result:
 isingh30
		
			isingh30
		
		
		
		
		
		
		
		
	
			
		
		
			
					
		You want solution in UI or at script level?
Temp:
LOAD Project,
ProjectDate,
ProjectApproach
FROM
(ooxml, embedded labels, table is Sheet1)where ProjectApproach > 0;
Temp1:
LOAD Project,
ProjectDate,
ProjectApproach
FROM
(ooxml, embedded labels, table is Sheet1)where Project = 'P432';
 
					
				
		
Hello!
Try this:
Data:
Load * Inline
[
Project, ProjectDate, ProjectApproach
P123, 01/01/2018, Agile
P235, 03/03/2018, Non-Agile
P353, 06/03/2018, Agile
P353, 06/03/2018,
P235, 03/03/2018,
P432, 03/05/2018,
];
FinalData:
NoConcatenate
Load FirstValue(Project) As Project,
FirstValue(ProjectDate) As ProjectDate,
LastValue(ProjectApproach) As ProjectApproach
Resident Data
Group By Project, ProjectDate
Order By ProjectApproach;
Drop Table Data;
Result:
 
					
				
		
 posywang
		
			posywang
		
		
		
		
		
		
		
		
	
			
		
		
			
					
		Thank you for the solution! It worked like a charm!
